Review of Modern Logistic Regression Methods with Application to Small and Medium Sample Size Problems

نویسندگان

  • Enes Makalic
  • Daniel F. Schmidt
چکیده

Logistic regression is one of the most widely applied machine learning tools in binary classification problems. Traditionally, inference of logistic models has focused on stepwise regression procedures which determine the predictor variables to be included in the model. Techniques that modify the log-likelihood by adding a continuous penalty function of the parameters have recently been used when inferring logistic models with a large number of predictor variables. This paper compares and contrasts three popular penalized logistic regression methods: ridge regression, the Least Absolute Shrinkage and Selection Operator (LASSO) and the elastic net. The methods are compared in terms of prediction accuracy using simulated data as well as real data sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Penalized Lasso Methods in Health Data: application to trauma and influenza data of Kerman

Background: Two main issues that challenge model building are number of Events Per Variable and multicollinearity among exploratory variables. Our aim is to review statistical methods that tackle these issues with emphasize on penalized Lasso regression model.  The present study aimed to explain problems of traditional regressions due to small sample size and m...

متن کامل

Sample size determination for logistic regression

The problem of sample size estimation is important in medical applications, especially in cases of expensive measurements of immune biomarkers. This paper describes the problem of logistic regression analysis with the sample size determination algorithms, namely the methods of univariate statistics, logistics regression, cross-validation and Bayesian inference. The authors, treating the regr...

متن کامل

بکارگیری روش باز نمونه گیری بوت استرپ در رگرسیون لجستیک و کاربرد آن در تحلیل داده های مربوط به بیماران مبتلا به سرطان سینه

Background and Aim: The purpose of this study was to assess the accuracy of the bootstrap method in logistic regression and to explore the method's use in logistic regression models in cases where the sample size is insufficient. Materials and Methods: We use data from 150 patients who had undergone surgery at the Cancer Institute, Emam Khomeini hospital during from 1999 to 2001. Then we drew...

متن کامل

FUZZY LOGISTIC REGRESSION: A NEW POSSIBILISTIC MODEL AND ITS APPLICATION IN CLINICAL VAGUE STATUS

Logistic regression models are frequently used in clinicalresearch and particularly for modeling disease status and patientsurvival. In practice, clinical studies have several limitationsFor instance, in the study of rare diseases or due ethical considerations, we can only have small sample sizes. In addition, the lack of suitable andadvanced measuring instruments lead to non-precise observatio...

متن کامل

Substitution of inorganic fertilizers with organic manure reduces nitrate accumulation and improves quality of purslane

Growers often apply high amounts of chemical fertilizers for vegetable production and this application contributes to concerns about nitrate levels in food. An experiment was conducted to investigate soil N amendment effects for reducing the nitrate accumulation and improving the quality of fresh purslane (Portulaca ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010